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Abstract 

Recently we reported on an application of the Tsallis non-extensive statistics to the S&P500 
stock index . There we argued that the statistics are applicable to a broad range of markets and 
exchanges where anamolous (super) diffusion and 'heavy' tails of the distribution are present, as 
they are in the S&:P500. We have characterized the statistics of the underlying security as non- 
extensive, and now we seek to generalize to the non-extensive statistics the excess demand models 
of investors that drive the price formation in a market. 
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In the past decade, there have been many models (P, 0) proposed that attempt to capture 
the dynamics and statistics of market participants that drive the price changes in a market. 
These range from minority game models, multi-agent models, and spin models of the bias 
of investors. They all have in common the fact that they are models for the excess demand. 
The instantaneous excess demand (f){t) can be defined as the mismatch in the number of 
buyers N+{t) and sellers N^{t) for a number of shares at time t. The hallmark for the 
success of an interacting investors market model has been the ability of a given model in 
reproducing the stylized facts of real markets. These are the heavy tails (power-law) of the 
distributions, anomalous (super) diffusion, and therefore statistical dependence (long-range 
correlations) of subsequent price changes (0). 

Recently the Tsallis non-extensive statistics were applied to analyzing the price changes 
and intra-day statistical dynamics of the S&P500 stock index ([Q). This study characterized 
the statistics of the price changes as being well-modeled by the non-extensive statistics. It 
was also argued that the statistics are applicable to a broad range of markets and exchanges 
where anamolous (super) diffusion and power-law tails of the distribution are present, as 
found in the S&P500 {[§). 

In this paper we examine the demand-side in light of these recent findings and outline a 
method by which one can obtain many-investor models within the context of the Tsallis non- 
extensive statistics using the maximum entropy approach. We review the maximum entropy 
approach to be utilized. The non-extensive, least-biased probability density (PDF) P{z, t) 
of an underlying (say, continuous) observable z{t) is obtained by maximizing an incomplete 
information-theoretic measure equivalent to the Tsallis entropy Sq ( [|l^, |l3l) and subject to 
the known (or assumed) observables of the system as constraints 

{s)q = s, = -Y^^{i-J P{z,tydxy 

{z)q = j zP{z,tydx. (1) 

The maximization of the entropy with a a Lagrange multiplier is 

5(5,)-5[a(2)J = 0, (2) 

and the maximization yields the least biased probability distribution given the constraints. 
In the nonextensive statistics this is a g-parametrized power-law Tsallis distribution (||10||) 



and q is the degree of non-extensivity or equivalently the incompleteness of the information 
measure. In the hmit of g — > 1 we recover the Gibbs-Boltzmann extensive statistics and 
the distribution becomes an exponential. The normalization is Zq{t), the partition function, 
and a{t) is a possibly time-dependent Lagrange multiplier associated with the constraints. 
Again, the constraints will be the known (or assumed) observables of interest, and are 
presumed to capture the statistical behavior of our many-investor system. 

We initially wish to model the fluctuating variables of the number of investors N±{t) 
buying and selling (— ) in the market. We write the excess demand as (f){t) — N+{t) — 
N_{t) and note that it is proportional to the instantaneous price x{t). That is, we can write 
approximately 

.(t) = M . (4) 
A A 

where A is the market depth, or the excess demand needed to move the price by one dollar. 
The market depth is assumed to evolve slowly during the time scales considered, and will 

be taken to be a constant. The observables of interest arc the means and variances of the 
fluctuating variables. We can write the entropy and observables as statistical averages about 
the means 



" \ N+,N^=0 

N 

((Ar+-7V+)) = 0= E {N+-N+)P{N+,N_,ty, 

^ N+,N-=0 
N 

^ N+,N-=0 
N 

N.-N.)) =0= E {N--N_) P{N+,N_,ty, 

^ N+,N.=0 
N 

N_-N.y) = E (N^-N.)' P{N^,N.,ty =e{t\, (5) 

^ N+,N_=0 

N 

subject to the additional constraint of normalization 1 = J2 Pi^+i N_,t) . We then 

Ar+,Ar_=o 

seek the least biased probability distribution given these observables. We maximize the 
entropy given the constraints 

S {S), - S [p{t)[{{N^ - N^r)^ + ((7V_ - N-r)j] ^ 0, (6) 
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and here f3{t) is a time dependent Lagrangian multipher and can be shown to be proportional 
to the inverse of the variance [jl|. The maximization yields the power-law least biased 
probability distribution 

N 

where Zq = J2 is the partition function and is related to the normal- 

N+,N.=0 

ization. We can therefore obtain all the observables of interest by performing statistical 
averages with respect to the distribution. The evaluation of the Lagrange multiplier(s) 
proceeds as in the extensive statistics case ( |]12| ) 

= "(iw, + >?-(*),. (8) 

and in the limit as 5 ^ 1 we recover the standard extensive statistics expression — = 

The demand-side model then provides us with a statistical description of the fluctuating 
variables (A^+, A^-), the difference of which can be related to the price by x{t) = . 
This model has the power-law behavior of real markets as a consequence of the pseudo- 
additive nature of the entropy of the subsystems. That is to say, the number of investors 
buying and selling at time t in the 'subsystems' of buyers and sellers are not statistically 
independent, and the incremental changes in time of the number of buyers and sellers will be 
correlated in time. It is also known that the price changes dx{t) exhibit anomalous diffusion 
and correlations in time and therefore by dx{t) = so will the change in the excess 
demand d(p{t). The composition of the entropy of the two subsystems must be written from 
the joint probability decomposition (suppressing the time parameter) 

P(A^+, N_) = P{N+ I N_)P{N_) (9) 

which gives the pseudo-additive entropy 

SqiN+, N^) = S,(iV_) + Sq{N+ I iV_) + (1 - q)Sq{N.)S,iN+ \ N.), 

pi-1 _ 1 

S, = -lnqP = . (10) 

I — q 

It can be shown (|]14[) that the Tsallis entropy satisfies this condition, and the resulting 
probability will be of the power-law form derived above. 



In order to simplify the solution, let us symmetrize the range of the distribution. We define 
n± = N± — Y such that the variable n± now has the symmetrical range — ^ ^ ''^± ^ y ^^^d 
the excess demand becomes (f) = — n_. We next pass to the continuum limit of the number 
of investors. Alternatively, we could have started our derivation from this approximation. 
Also, due to the observation that most trades involve a small fraction of the overall number 
of investors ( N± — N± ^ A^) we relax the range to iV ^ oo. The distribution P{n^, t) 
is now taken to be symmetric and continuous, and is given by a similar power-law form as 
before 

P(n,, t) ^ (1 + - ') - + - . (U) 

Zq 

We integrate to obtain the partition function 

oo oo 

Zq{t) = [ [ P{n+,n_,t)dn+dn_ (12) 

(13) 



-oo — oo 

vr 



/?(t)(2-g)' 

where the range of q must now be restricted to 1 < < 2 to insure normalization. We 
note that (3{t)Zq{t) = Cg , a time independent constant of the process dependent on the 
non-extensivity parameter q. This is useful in that we can relate the Lagrange multiplier 
parameter (3{t) (and therefore the inverse variance) to the partition function for all times 
considered. The range of q is chosen as 1 < g < qmax consistent with the requirements that 
the distribution is normalizable for all times and that the regular variance remain finite. 
The regular variances rj'^it) can be computed readily from the distribution 

oo oo 

>± - n.r-) (n± - n.mn., n..t)d„,d„. (14) 



— OO — oo 
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2 Pit) (3-2g)' 

and the range of q must be further restricted to 1 < g < 1.5 to assure convergence of the 
integral and therefore a finite variance. 

The power-law distribution P{nj^,n-,t) has a time evolution that satisfies a two dimen- 
sional Fokker-Planck partial differential equation ( |jlO|, |lll), and we assume a linear drift 
term a{n±) = a± n± consistent with the linear drift of the price distribution (|T|]) 
d d d 



This subsequently implies the underlying stochastic differential equations ( 0) for the num- 
ber of buyers and sellers 



drij^ = a{n+)dt + ^DP^-i{nj^,n_,t) dW+{t), 

dn_ = a{n_)dt+ ^DP^-i{n+,n^,t) dW^{t). (16) 

Here dW±{t) = C±{t)dt , where W±(t) is the Wiener process and ^±(t) is a delta correlated 
noise such that {^ait) Co-it')) = ^aa'^it — t')- If we examine the change in the excess demand 
we obtain the relationship to the change of price 

d(j) = dn+ — dn_ 
= A dx{t) 

The solutions of these SDE's can be obtained by using Ito's change of variables and per- 
forming the integration over time 

t 

n±{t) = ri±(to)e"±(*-*°) + J e'^±(*-*')^DFi-'/(n+, n_,t)dt', (17) 

to 

and we thus have the desired result for the formation of the instantaneous excess demand 
(p{t), and therefore the instantaneous price x{t) 

cf>{t) = n+{t)-n^{t) 

= A x{t). (18) 

We note that the solutions of these stochastic equations depends on the knowledge of the 
time evolution of the probability P(?7,+ , n_, t). One must then solve the Fokker-Planck 
equation simultaneously with the SDE's. Equivalently one can obtain the time evolution of 
(3(t) analytically and therefore the evolution of P{n^,n^,t) as outlined in (Jll). 

The stochastic differential equations (SDEs) in Eq.([T6|) can be seen to be a non-extensive 
SDE generalizations of the Cont and Bouchaud model ( 0), in the continuum approxima- 
tion. Moreover, they are of the statistical feedback form (|jl|, |^), in that the microscopic 
stochastic dynamics are coupled to the macroscopic probability distribution P(?t,+ , t). 
The microscopic stochastic equations and the macroscopic Fokker-Planck equation are 
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known to be equivalent descriptions of a given random process. In this case of anomalous 
diffusion of the excess demand, the statistical dependence of subsequent demand changes 
are explicitly represented in the SDE. Hence this statistical model of the excess demand re- 
produces the stylized facts of real markets, namely power-law distributions and anomalous 
diffusion as a consequence of the pseudo-additivity of the system entropy. 

We have in this article generalized a model of excess demand and price formation to the 
case of the non-extensive statistics of C. Tsallis. We state the need to go beyond the extensive 
Gibbs-Boltzmann statistics in modeling real markets from the point of view of the pseudo- 
additivity of the entropy of statistically dependent subsystems, and the decomposition of 
the non-factorizable joint probability. This approach to statistically dependent subsystems 
and statistically correlated variables has been applied directly to the price statistics of real 
markets ^ ^, ^) and shows promise in modeling the stylized facts of these markets, 

such as the power-law behavior and anomalous diffusion of price changes. We are working 
to examine this model given data sets of investor demand and market depths from a real 
market (such as the S&P500) that is known to exhibit the anomalous diffusion and power- 
law behavior of price changes. We are also recasting this model of investor demand in 
terms of spins (bias), and moving beyond this number statistics model to an interacting 
many-investor spin model of markets. 

M.D. Johnson and F. Michael acknowledge support from the NSF through grant number 
DMR99-72683. 
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